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1. Introduction 

The statistical analysis of shape distributions based on random samples is important 
in many areas such as morphometries (discrimination and classification of biolog- 
ical shapes), medical diagnostics (detection of change or deformation of shapes in 
some organs due to some disease, for example) and machine vision (e.g., digital 
recording and analysis based on planar views of 3-D objects). Among the pioneers 
on foundational studies leading to such applications, we mention Kendall [20] (also 
see Kendall et al. [21]) and Bookstein [9]. The geometries of the spaces are those of 
differentiable manifolds often with appropriate Ricmannian structures. 

'Supported by NSF Grant DMS 04-06-143. 

^Department of Mathematics, University of Arizona, Tucson, AZ 85721, USA, e-mail: 
abhishekOmath. arizona. edu; rabiOmath. arizona. edu 

AMS 2000 subject classifications: Primary 62G20; secondary 62E20, 62H35. 

Keywords and phrases: extrinsic and intrinsic means and variations, Kendall's shape spaces, 
two-sample nonparametric tests. 



282 



Nonparametric statistics on manifolds 



283 



Our goal in this article is to establish some general principles for nonparametric 
statistical analysis on such manifolds and apply those to some shape spaces, es- 
pecially Kendall's two-dimensional shape space of the so-called k-ads, i.e., the 
space of configurations of k points on the plane (not all identical), identified mod- 
ulo size and under Euclidean motions of translation and rotation. Two sample tests 
for the comparison of both extrinsic and intrinsic Frcchct mean shapes and mean 
variations of two distributions on S| are provided. As far as we know the explicit 
computations of these tests are new. In the case of the intrinsic mean and variation, 
the usual support criterion (see, e.g., Le [24] and Bhattacharya and Patrangenaru 
[G, 7, 8]) is significantly relaxed, thereby substantially enhancing the applicability 
of the tests. 

For recent results on statistical analysis of 3-D shapes, which we do not consider 
here, we refer to Dryden et al.[ll] and Bandulasiri et al. [2]. 

Sometimes the sample sizes in shape analysis are only moderately large. Under 
such circumstances, one may more effectively use Effron's bootstrap methods (Ef- 
fron [14]), whose superiority over the classical CLT-based confidence regions and 
tests may be established via higher order asymptotics (see, e.g., Babu and Singh 
[1], Bhattacharya and Qumsiyeh [^], Bhattacharya and Ghosh [4], Ghosh [16], Hall 
[17]). 

We next turn to the specific example of main interest to us, namely, E^- F^^i' 
purposes of medical diagnostics, classification of biological species, etc., one may 
use expert help to choose a suitable ordered set of k points or landmarks in the 
plane, or a fc-arf, 

z = {{xj,yj),l <j< fc}, 

on a two-dimensional image of an object under consideration. One assumes that 
not all k points are the same, and k > 2. Kendall's shape space comprises the 
equivalence classes of all such k-ads under translation, rotation and scaling. For 
a given k-ad z, the effect of translation is removed by considering z — (z) where 
(z) is the vector whose elements are all equal to the mean location of the k-ad, 
namely, (1/fc) J2j=ii^j ^Vj)- The translated k-ads then lie in the (2fc— 2)-dimensional 
hyperplane H of (SR^)'^' w 3?^*^', given by 

H = {ixj,yj)i<j<k ■■ Xj = 0, ^ Vj = 0}, 

and they comprise all of H except the origin. The effect of scale, or length, is 
removed by dividing z — (z) by i|z — (z)|| where ]|.]| is the usual Euclidean norm in 

\\{u,,v,),<,<u\\ = [Y^{u]+v^)]^'\ 

The resulting transformed k-ad w = (z — (z))/]|z — (z)|| is called the preshape of the 
k-ad z. The set of preshapes is then naturally identified with the unit sphere in H, 
which is basically the same as the unit sphere S'^''~^ in 3?^'^"^. Finally, the shape 
[z] of a k-ad z is given by the orbit of w = {uj, Wj)i<j<fc under rotation, namely. 



(1.1) 



cos — sm 6 
sin 9 cos 9 



i<j<k 



-n < 9 < t:. 



Thus T,2 is a quotient space of S'^'"' ^, namely, S'^'^ ^/S^, and it has dimension 
2fc-4. 

We will use a mathematically more convenient way of describing achieved 
by viewing a k-ad as an element of C*"', namely, z = {xj + iyj)i<j<k- Then (z) is 
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the complex fc- vector whose elements are all equal to (1/fc) X]j=i(^j + Wj)- The 
translated k-ad then lies in the complex (fc — l)-dimensional hyperplane 

k 

H"-^ = {(«j)i<j<fe e : = 0}- 

The norm ||z — (z)j| has the same value as before. But the rotation by an angle 
6' of w = (z — (z))/||z — (z)|| may now be expressed as e**w. For a system of 
coordinate neighborhoods, or parametrization, of this spherical representation of 
as a quotient space of S^''~^, see Gallot et al. ([15], pp. 32, 34). 
Another parametrization of S| , compatible with the above, is obtained by view- 
ing the shape of a k-ad z = {xj + iyj)i<j<k as the orbit 

{zo(z-(z)) : zoeC\{0}}. 

Note that zq = Ae'^ for A = \zo\ and some 6 e (— tt, tt]. so that the orbit, namely, a 
complex line through the origin in H^^^ , is independent of both scale and rotation 
and, therefore, a representation of the shape of z. Thus S]| is (isomorphic to) the 
space of all complex lines through the origin in C*^^^, the complex projective space 
(j~>pfc-2^ a familiar and important example in differential geometry. For a system of 
coordinate neighborhoods for S]| viewed as CP'^"^, see Gallot et al. ([15], pp. 9, 
10, 64, 65). 

We next consider an extrinsic distance on corresponding to a special embed- 
ding, namely, the Veronese- Whitney embedding 4>e of E2 i'^to the space S{k,C) of 
k X k complex Hermitian matrices: 

(1.2) (/)£;([z]) = WW* 

where w = (z — (z))/||z — (z) || is the preshape of z. Here w is regarded as a column 
vector of k complex numbers, w = (wi,W2, ■ ■ ■ ,WkY, and w* is the transpose of 
its complex conjugate. Observe that the right side of (1.2) is constant on the orbit 
{e^^w : — TT < 6* < tt} of the preshape w and is, therefore, a function of the shape 
[z] of the k-ad. Also, this function is one-to-one on into S{k,C). The vector 
space S{k, C), with the real scaler field 3?, has dimension k^. This is because a. kx k 
Hermitian matrix is specified by k real numbers on the diagonal and (2) complex 
numbers (i.e., 2(2) real numbers) as lower-right off-diagonal elements. On S{k,C) 
define the norm ||.|| and distance d by 

PIP = Trace A^* = TraceA^, 

(1.3) d^{A,B) = = Trace(^-B)2. 

Note that this is the same as the Euclidean norm and distance in 3?^'^ . The induced 
distance pE on is then given by 

pl{[z],[v^]) = d\MM),M[^])) 

(1.4) =2-2|u*vp 

where u and v are the preshapes of [z] and [w] respectively. The distance pE is 
known as the full Procrustes distance (Kendall [22], Kent [23] and Dryden and 
Mardia [12]). 



= Tracc(uu* - vv*)^ 

fe fe 



Nonparametric statistics on manifolds 



285 



Let Xj, 1 < j < 71, be i.i.d. k-ads such that their shapes [Xj], 1 < j < n, have the 
common distribution Q. Let p, denote the Euchdean mean of Q = Q°<Pe^ viewed as 
a probabihty measure on S{k, C). Let AI ~ and denote the Euchdean pro- 

jection of p, on M by Pp, = Pjyjp- The extrinsic mean of Q is then /ig = (j>~^{Pp). 
It minimizes the Frechet function (2.1) with respect to the distance pE- Similarly, 

for the sample extrinsic mean, calculate PX where X — (l/?T-)X]j=i 't'E{[Xj]) is a 
coordinate- wise average of the matrix elements WjW* and Wj is the preshape of 

Xj (1 < j < «)• The asymptotic distribution of -Jn{PX — Pp) is given by that of 
its projection on the tangent space Tpf^M at P/i, since its projection on the com- 
plement of TpjiM is negligible. For computation of this projection, one chooses a 
suitable orthonormal basis of S{k, C) (considered as a single orthonormal frame for 
its constant tangent spaces), and calculates the differential of the projection map 
P = Pj^j : S{k, C) — > M in terms of these coordinates. One thus arrives at a non- 
singular (2A: — 4)-dimensional Normal distribution in the limit (see Sections 3.1-3.4 
for details). 

Turning to the intrinsic mean on a Riemannian manifold M, with geodesic 
distance dg, the first problem to resolve is its existence as the unique minimizer of 
the Frechet function / d^{p, m)Q{dm). Here a result of Karchar [19] on the existence 
of a unique minimizer is greatly improved by a result of Kendall [22], which allows 
the radius r of a geodesic ball B{p, r) containing the support of Q to be twice as large 
as required by Karchar [19] (Proposition 4.1). On such a ball, the map (j) = exp~^ 
(the inverse of the exponential map at p), is a diffeomorphism onto its image in 
the tangent space TpM at p. Using the coordinates of the vector space TpM, called 
normal coordinates, one arrives at a central limit theorem for the sample intrinsic 
mean /x„/ (Theorem 4.2), following Bhattacharya and Patrangenaru [8]. Note that, 
with the (non-Euclidean) distance on TpM induced by ^ from the geodesic distance 
dg on M, the image fXn = 4'ilJ'ni) of fXni is the minimizer of the Frechet function 

Fnix) = J dl{(p-'x,(j>-'y)Qn{dy) 

where Qn = Qn ° 4>^^, Qn = (V") '^[Xj]- Thus /i„ is a M-estimator in the 

Euclidean space TqM. The assumptions in Theorem 4.2 guarantee that this M- 
estimator is asymptotically Gaussian around fi = (j){pLi). The asymptotic distribu- 
tion of the test statistic (4.5) follows from this. 

The computation of the test statistic (4.5) is generally more involved than that 
used for comparing extrinsic means (see, e.g., (3.17) for the case M = S2). This 
involves, in particular, the metric tensor of M to compute geodesies and normal 
coordinates. We refer to [■]] for the asymptotic theory for intrinsic means, with 
explicit computations of parameters especially for the planer shape space of le- 
ads. However in Section 5 of the present article, we display numerical values of 
the intrinsic two-sample test statistics, along with the corresponding p-values, in 
two examples. It may be noted that for highly concentrated data in each of these 
examples, the extrinsic and intrinsic distances are close and hence the extrinsic and 
intrinsic test statistics have virtually the same values. 

The minimum value attained by the Frechet function is called the Frechet vari- 
ation of Q and it is a measure of spread of the distribution Q. The sample Frechet 
variation is a consistent estimator of the Frechet variation of Q as proved in Propo- 
sition 2.4. If the Frechet mean exists, we derive the asymptotic distribution of the 
sample Frechet variation in Theorem 2.5. This can be used to construct a nonpara- 
metric test statistic to compare the spread of two populations on M . We compute 
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numerical values of the test statistic, along with the p-values for M £3 in Sec- 
tion 5. For highly concentrated data as in the examples considered in Section 5, the 
Frechet variations of the distributions are very small. Then the mean comparison is 
usually sufficient to discriminate between the populations and the variations show 
no significant difference. 

We conclude this section with two brief remarks. First, the main objective of 
inference in the two-sample problem on I]| is to discriminate between two different 
distributions on it. It turns out, in most practical problems that arise, that the 
means and variations (extrinsic or intrinsic) are generally adequate for this dis- 
crimination. More elaborate procedures such as nonparametric density estimation 
suffer from the "curse of dimensionality" on this commonly high-dimensional space. 
One can, however, do such density estimation on a tangent space (e.g., on T^^M, 
via the inverse exponential map exp~^^), as in the Euclidean case. Excepting for the 
computation in normal coordinates, this presents no novelty. Secondly, in examples 
with real data sets that we have studied (e.g., those in Section 5), the p-values of 
the nonparametric two-sample tests for comparing means, developed in this arti- 
cle, are always much smaller (often by an order of magnitude or more) than those 
based on existing, mostly parametric, tests in the literature (see Dryden and Mar- 
dia [12]). This seems to indicate that the tests proposed here may be more powerful 
than those that have been used in the past, for many data sets that arise in prac- 
tice. This perhaps also points to the inadequacy of parametric models of shapes 
popularly used in the literature in capturing certain important shape features. 

2. Prechet mean and variation on metric spaces 

Let (A/, p) be a metric space, p being the distance on M. For a given probability 
measure Q on (the Borel sigma-field of) M, define the Frechet function of Q as 



2.1. Frechet mean 

Definition 2.1. Suppose F{p) < 00 for somcp e AI. Then the set of allp for which 
F{p) is the minimum value of F on M is called the Frechet mean set of Q, denoted 
by Cq. If this set is a singleton, say {/xf}, then fj,p is called the Frechet mean 
of Q. If Xi,X2, ■ ■ ■ ,Xn are independent and identically distributed (i.i.d.) with 
common distribution Q, and Q„ = (l/'^)X]j"=i is the corresponding empirical 
distribution, then the Frechet mean set of Qn is called the sample Frechet mean 
set, denoted by Cq„. If this set is a singleton, say {/iF„}, then yUi?„ is called the 
sample Frechet mean. 

The following result has been proved in Theorem 2.1, Bhattacharya and Pa- 
trangenaru [7]. 

Proposition 2.1. Suppose every closed and bounded subset of M is compact. If the 
Frechet function F{p) of Q is finite for some p, then Cq is nonempty and compact. 

The next result establishes the strong consistency of the sample Frechet mean. 
For a proof, see Theorem 2.3, Bhattacharya and Patrangenaru [7]. 



(2.1) 
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Proposition 2.2. Assume (i) that every closed bounded subset of M is compact, 
and (ii) F is finite on M. Then given any e > 0, there exists an integer valued 
random variable N = N(iu, e) and a P-null set A{uj^ e) such that 



outside of A[ijj,e). In particular, if Cq = {mf}, then every measurable selection 
fip^ from Cq_^ is a strongly consistent estimator of i^ip. 

Remark 2.1. It is known that a connected Ricmannian manifold M wliicli is 
complete (in its geodesic distance) satisfies the topological hypothesis of Proposi- 
tions 2.1 and 2.2: every closed bounded subset of M is compact (see Theorem 2.8, 
Do Carmo [10], pp. 146-147). We will investigate conditions for the existence of the 
Frechet mean of Q (as a unique minimizer of the Frechct function F of Q) in the 
subsequent sections. 

Remark 2.2. One can show that the reverse of (2.2), that is, "Cq C Cq V ti > 
N{u!, e)" does not hold in general. See, for example, Bhattacharya and Patrangenaru 
([7], Remark 2.6). 

Next we consider the asymptotic distribution of fip^. For Theorem 2.3, we assume 
M to be a differentiable manifold of dimension d. Let p be a distance metrizing the 
topology of M. For a proof of the following result, see Theorem 2.1, Bhattacharya 
and Patrangenaru [8]. 

Theorem 2.3. Suppose the following assumptions hold: 

(i) Q has support in a single coordinate patch, {U,(j)), (p ■ U — > smooth. 
LetYj =,P{X,), J = l,...,n. 

(ii) The Frechet mean fip of Q is unique. 

(iii) \fx, y 1-^ h(x,y) — p'^{(j)~^x,4>~^y) is twice continuously differentiable in a 
neighborhood of (j)(fiF) — fi. 

(iv) E(Drh(Yi,fi))^ < oo Vr. 

(v) £■( sup \I)sI)rh{Yi,v) -I)sI)rh{Yi,u)\) ^ as e ^ Oy r,s. 

\u—v\<e 

(vi) A = E {DsDrh(Yi, fi)) is nonsingular. 

(vii) S ~ Gov (D/i(Yi, /i)) is nonsingular. 

Let be a measurable selection from the Frechet sample mean set, and write 
H„ — (l){fip^). Then under the assumptions (i)-(vii), 



2.2. Frechet variation 

Definition 2.2. The Frechet variation of Q is the minimum value attained 
by the Frechet fimction F defined by (2.1) on AI. Similarly the minimum value 
attained by the sample Frechet function, 



(2.2) 



Cq„ cC'Q = {peM: p{p,Cq) < e}, \fn>N 



(2.3) 



V^i^in - m) ^ ^(0, a-ie(a')-I). 



(2.4) 




is called the sample Frechet variation and denoted by 
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From Proposition 2.1 it follows that if the Frcchet function F{p) is finite for some 
p, then V is finite and equals F{p) for all p in the Frechet mean set Cg. Similarly 
the sample variation Vn is the value of Fn on the sample Frechet mean set Cg„. 
The following result establishes the strong consistency of Vn as an estimator of V. 

Proposition 2.4. Suppose every closed and bounded subset of M is compact, and 
F is finite on M. Then Vn is a strongly consistent estimator of V . 

Proof. In view of Proposition 2.2. for any e > 0, there exists N ~ N{uj,e) such that 
(2.5) \Vn -V\ = \ MFnip) - inf < sup |F„(p) - Fip)\ 

peM p&M 

for all n > N almost surely. From the proof of Theorem 2.3 in Bhattacharya and 
Patrangenaru [7], it follows that for any compact set K C M, 

sup|F„(p) — F{p)\ — > a.s. as n — > oo. 

Since Cg is compact, it follows from (2.5) that 

\Vn — V\ — > a.s. as n oo. □ 

Remark 2.3. The sample variation is a consistent estimator of the population 
variation even when the Frechet function F oi Q does not have a unique minimizer. 



Next we derive the asymptotic distribution of Vn when there is a unique popu- 
lation Frechet mean. 

Theorem 2.5. Let M be a differentiate manifold. Using the notation of Theo- 
rem 2.3, under assumptions (i)-(vii) and assuming E{p'^{Xi, ^p)) < oo, one has 

(2.6) V^iVn -V)^N (0, var(p2(Xi, ^j.))) . 
Proof. Let 

r 1 " 

F{x) = / p^{^~\x),m)Q{dm), F„(x) ^ - J2p^{^~H^),X,). 

i=i 

Let fip^ be a measurable selection from the sample mean set and /x„ — (j^dipn)- 
Then 

V^iVn -V)= V^iFnifln) - F(/i)) 

(2.7) = V^iFniPn) - Fn{p)) + V^{Fn{fl) - F{fl)), 

^ 71 d 

Vn{Fn{fin) - FnifJ.)) = —^^^il^n - fJ.) r HYj , fl) 

_^ n d d 
V j^i r=l s=l 

for some /i* in the line segment joining fi and fin- By assumption (v) of Theorem 2.3 
and because ^/n{pn ^ m) is asymptotically normal, the second term on the right of 
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(2.8) converges to in probability. Also (l/n) ^"^^ I)h(Yj,^i) E (D/i(Yi, ^)) = 0, 
so that the first term on the right of (2.8) converges to in probability. Hence (2.7) 
becomes 

V^iVn -V) = V^(K(/^) - Fill)) + Op(l) 

1 " 

(2.9) = --=^(p2(X,,AiF) -Ep2(Xi,/i;^)) +Op(l). 

By the CLT for the i.i.d. sequence {p'^{Xj, ^p)}, (2.9) converges in law to 
N{0,YSir{p^Xul^F))- □ 

Remark 2.4. Theorem 2.5 requires the population mean to exist for the sample 
variation to be asymptotically Normal. It may be shown by examples that it fails 
to give the correct distribution if there is not a unique mean. 

Theorem 2.5 can be used to construct a nonparametric test for testing whether 
two populations have the same spread. Suppose Qi and Q2 are two probability 
distributions with unique Frechet means fj,ip and ^2F and Frechet variations Vi 
and V2, respectively. We have i.i.d. samples Xi,X2, ■ ■ ■ , X^ and Yi, F2, ■ • ■ , from 
Qi and Q2, respectively. Let jiPn and pLp^ denote the sample means, Vn and Vm 
denote the sample variations. Then the null hypothesis is 

Ho:Vi=V2 = V. 

Under Hq, from (2.6), 

(2.10) ^/^{Vn-V) ^ N{Q,nl), 

(2.11) V^(Kn -V)^ N{0, al), 

where al = var(p^(Xi, /iip)), al = var(p^(Yi, //2f))- 

Suppose nKra + n) — > m/ (m + n) q, for some p,q > 0; p + q — 1. Then from 
(2.10) and (2.11), 

(2.12) V^TTTKiVn - V,n) NiO, ^ . '^2 



(2.13) ^" ^A^(0,1), 



q 



where sf = {l/n)Y,%^{p'^{Xj, ^ip^^) - K)^ and si = (l/m) i(/0^(^j.Mf„) - 
Vm)'^ are the sample estimates of af and (t|, respectively. Hence the test statistic 
used is 

(2.14) r„„= 



For a test of size a, we reject Hq if |T„„i| > ^i_(q/2) where .^1-^(0/2) is the (1 — 
(a/2))'h quantile of A^(0,1). 

From now on, unless otherwise stated, we assume that (M, g) is a d-dimensional 
connected complete Riemannian manifold, g being the Riemannian metric tensor 
on M. We shall come across different notions of means and variations depending on 
the distance chosen on M . Wc begin with the extrinsic distance in the next section. 
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3. Extrinsic mean and variation 

Let <j>: M ^n'' he an embedding of M into SR'^', and let M = 0(Af ) C SR*-'. Define 
the distance on M as: p{x,y) = — (l){y)\\, where || • || denotes EucHdean norm 

(II'^IP = = (mij 1*2, . . . , Ufe)'). This is called the extrinsic distance on 

M. 

Assume that M is a closed subset of 3?^^. Then for every u G 3?^^ there exists a 
compact set of points in M whose distance from u is the smallest among all points 
in M. We will denote this set by 

Pu = Pj^jU ^{xe M : \\x - u|| < \\y - u\\ Vy G M}. 

If this set is a singleton, u is said to be a nonfocal point of di'' (with respect to M); 
otherwise it is said to be a focal point of 3?'^. 

Definition 3.1. Let {M,p), (j) be as above. Let Q be a probability measure on M 
with finite Frechet function. The Frechet mean (set) of Q is called the extrinsic 
mean (set) of Q, and the Frechet variation of Q is called its extrinsic variation. 
If Xj (j = l,...,n) are iid observations from Q, and Q„ = ^ X]j=i '^^i ^^i^ 
empirical distribution, then the Frechet mean(set) of Qn is called the extrinsic 
sample mean{sct) and the Frechet variation of Qn is called the extrinsic sample 
variation. 

Let Q and Q„ be the images of Q and Q„, respectively, on 3?*^ under (p: Q = 
Q o 4>^^ , Qn = Qn o The next result gives us a way to calculate the extrinsic 
mean and establishes the consistency of the sample mean as an estimator of the 
population mean if that exists. For a proof see Proposition 3.1 in Bhattacharya and 
Patrangenaru [7]. 

Proposition 3.1. (a) If jl ~ J^^k uQ{du) is the mean of Q, then the extrinsic mean 
set of Q is given by (f)''^{Pji). (b) If jl is a nonfocal point of^^ (relative to M), then 
the extrinsic sample mean finE (o-^V measurable selection from the extrinsic mean 
set of Qn) is a strongly consistent estimator of the extrinsic mean pE = 4'~^iPf^)- 

3.1. Asymptotic distribution of the sample extrinsic mean 

We can use Theorem 2.3 to get the asymptotic distribution of the sample extrinsic 
mean. However, expressions for the parameters A and E are not easy to get. Here 
we devise another way to derive the asymptotic distribution. We assume that the 
mean fl of Q is a nonfocal point, so that the projection Pp. of fl on 4>{M) is unique, 

and the extrinsic mean of Q is pE = <j)~^{Pli)- Let X = (1/n) denote the 

sample mean of Xj = 4){Xj). The extrinsic sample mean set is Cq„ = 4)^^{PX), 

where PX is the set of projection of X on 0(i\/). In a neighborhood of a nonfocal 
point such as /2, P{.) is smooth. So we can write 

(3.1) ^^i[P(l)-P(/l)] =y^(d^P)(A-/i) + op(l) = (d^P)(V^(l-M)) + op(l) 

where djxP is the differential (map) of P(-), which takes vectors in the tangent space 
of 3?*^ at fl to tangent vectors of (/>(M) at P(/i). Hence the left side is asymptotically 
normal. 
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For the case of regular submanifolds embedded in an Euclidean space by the 
inclusion map, a similar asymptotic distribution and a two-sample test were con- 
structed independently by Hendricks and Landsman [l >^] and, for more general 
manifolds, by Patrangcnaru [2()] and Bhattacharya and Patrangcnaru [8]. 

3.2. Application to the planar shape space of k-ads 

Consider a set of k points on the plane, e.g., k locations on a skull projected on a 
plane, not all points being the same. We will assume fc > 2 and refer to such a set as 
a k-ad (or a set of k landmarks). For convenience we will denote a k-ad by k complex 
numbers {zj = xj +iyj, 1 < .7 < fc), i.e., we will represent k-ads on a complex plane. 
By the shape of a k-ad z = (zi, Z2, ■ ■ ■ , Zk), we mean the equivalence class, or orbit of 
z under translation, rotation and scaling. To remove translation, one may substract 
(z) = ((z), (z), . . . , (z)) ((z) = (1/fc) X]j=i -^j) from z to get z — (z). Rotation of the 
k-ad by an angle and scaling (by a factor r > 0) are achieved by multiplying z— (z) 
by the complex number A = rexpi^. Hence one may represent the shape of the 
k-ad as the complex line passing through z— (z), namely, {A(z— (z)) : A G C\ {0}}. 
Thus the space of k-ads is the set of all complex lines on the (complex (fc — 1)- 
dimensional) hyperplane, H^~^ ~ {w <E \ {0}: X^i'^^i — 0}. Therefore the 
shape space I]| of planer k-ads has the structure of the complex projective space 
(j-<pfc-2. ^YiQ space of all complex lines through the origin in C*^"^. As in the case of 
it is convenient to represent the element of corresponding to a k-ad z 
by the curve 7(z) = [z] = {e'^((z — (z))/j|z— (z)||) : Q < 9 < 27r} on the unit sphere 
in H^-^ « C'^-^ 

If wc denote by u the quantity (z — (z) )/||z — (z) || , called the preshape of the shape 
of z, then another representation of is via the Veronese- Whitney embedding cj) 
into the space 5'(fc, C) of all fc x fc complex Hermitian matrices, ^(fc, C) is viewed as 
a (real) vector space with respect to the scaler field 3?. The embedding </) is given 

by 

0: E^^5(fc,C), 

cj){[z]) = uu* {u = (wi, . . .,uu)' e H''-\ \\u\\ = 1) 
(3.2) = {{uiUj))i<i^j<k- 

The shape of z, [z] = {e^^u: < 9 < 2tt} is the orbit of the vector u under rotation. 
Note that if vi,V2 e [z], then (t>{[vi\) = (/)([w2]) = 0((z- (z))/lk- (z)ll)- Define the 
extrinsic distance p on by that induced from this embedding, namely, 

/o q^ 2n 1 r n ii * *ii2 ■ ^ - (z) • w - (w) 

3.3) p {[z],[w]) ^\uu -vv \ ,u=- TTii,v^u i-Tu 

||z-(z)|| ||u;-(w)|| 

where for arbitrary k x k complex matrices A, B, 

(3.4) II A - Bf = \ajj, - bjj'f = Trace(^ - B){A - B)* 

is just the squared euclidcan distance between A and B regarded as elements of C'^^ 
(or, 3?^*^ ). Since the matrices uu* , vv* in (3.2) are Hermitian, one notes that the 
image 4'{'^2) of ^2 '"^ closed subset of C''" and the "conjugate-transpose" symbol 
* may be dropped from (3.4) in computing distances in (^(Sj). 
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Let Q be a probability measure on the shape space Sj, let [Xi], [X2], ■ ■ ■ , [Xn] 
be an i.i.d. sample from Q and let /i denote the mean vector of Q = Q o 
regarded as a probability measure on C'' (or, SR^*^ ). Note that /i belongs to the 
convex hull of M = 0(Sf) and in particular, is an element of H''~^. Let T be a 
(complex) orthogonal k x k matrix such that TjlT* = D = Diag(Ai, A2, . . . , Afe), 
where Ai < A2 < • • ■ < arc the eigenvalues of //. Then, writing v = Tu with u 
as in (3.3), 

k 

\\uu* - /if = \\vv* - = Ed^.f - ^j f + E l^^-^^f 

j=i j=i j=i j'=i j=i 

(3.5) =^A/ + l-2^A,b,p 

which is minimized (on ^(Sj)) by taking v = = (0, . . . , 0, 1)', i.e., u = T*ek, a 
unit eigenvector having the largest eigenvalue A^ of fx. It follows that the extrinsic 
mean say, of Q is unique if and only if the eigenspacc for the largest eigenvalue 
of p, is (complex) one-dimensional, and then /ig = [/i], fi{^ 0) £ the eigenspace of 
the largest eigenvalue of fi. 

From (3.5), the extrinsic variation of Q has the expression 

V = mxixi -pp*f 

= E||XiX* - /if + ||/i-/i^*||2 

(3.6) -2(1 -Afe) 

Therefore, we have the following consequence of Proposition 2.4 and Proposition 3.1. 

Corollary 3.2. Let fin denote an eigenvector of i^/n)'^^^^ XjXj* having the 
largest eigenvalue Afe„. (a) // the largest eigenvalue Afe of fl is simple, then the 
extrinsic sample mean [fin] a strongly consistent estimator of the extrinsic mean 
[n]. (b) The sample extrinsic variation, Vn ~ 2(1 — Afc„) is a strongly consistent 
estimator of the extrinsic variation, V ~ 2(1 — Afe). 

The distance p on in (3.3) can be expressed as 

(3.7) p^{[z], [w]) = \\uu* - w*f = 2(1 - \u*v\'^). 

This is the so-called full Procrustes distance for S]|. See Kent [2.'!], Dryden and 
Mardia [12] and Kendall et al. [21]. 

3.3. Asymptotic distribution of mean shape 

To get the asymptotic distribution of the sample extrinsic mean shape using (3.1), 
we embed M = E2 i^ito S{k, C), the space of all fc x fc complex self-adjoint matrices, 
via the map (j> in (3.2). We consider S{k,C) as a linear subspace of C*^ (over 3?) 
and as such a regular submanifold of C'^ embedded by the inclusion map, and 
inheriting the metric tensor: 



{A, B) = Re (Tracc(AB')) 
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The (real) dimension of S{k, C) is k^. An orthonormal basis for S{k, C) is given by 
{v^ : 1 < a<b < k} and {wj^ : 1 < a < 5 < A:}, defined as 



^(ea4 + e&e|j), a<b 
eaC*, a = b 

wl = ~{---j={eael - e^e^), a <b. 

where {ea : 1 < a < fc} is the standard canonical basis for 3?'^. 

We also take {v^ : 1 < a < & < fc} and {vol : 1 < a < 6 < fc} as the (constant) 
orthogonal frame for S'(fc,C). For any U e 0(fc) (C/JJ* = [/*[/ = /), {J7w°C/* : 1 < 
a <b < fc}, {Uw'^U* : 1 < a < 6 < fc} is also an orthogonal frame for S{k,C). 
Assume that the mean p, oi Q has its largest eigenvalue simple. To apply (3.1), 
we view d/iP : 5(fc,C) ^ Tp(^)(/)(I]^). Choose U e 0(fc) such that = D = 

Diag(Ai, . . . , Afc), Ai < • • • < Afe_i < A^ being the eigenvalues of fl. 

Choose the basis frame {Uv^U* , Uw^U*} for S{k, C). Then one can show that 



(3.8) d^PiUw^U*) = 
Write 



0, if 1 < a < 6 < fc, a = & = fc, 

^{Xk~ Ky'^Uv'jtU*, if 1 < a < fc,6 =: fc. 

'O, ifl<a<6<fc, 

{Xk- Xay^Uw^U*, ifl<a<fc,5=fc. 



l<a<b<k 

(3.9) + ^^(V^(l-/i),C/<t/*)t/<C/*, 

l<a<h<fc 

Since Xl^ = p.lk = 0, Ai = and U,i = alk, \a\ — 1/^/k. Thus 

{y^{X - fl), UvlU*) = {y^{X - fl), UwlU*) = 0. 



Therefore, 



Af,P{^{X - m)) 
fc-1 

= 5](V^(1 - m), C/i;^[/*)(Afc - Xa)-'UvlW 



a=2 
fc-1 



(3.10) + ^{V^iX - il), UwlU*){Xk - Xa)-^UwlU*. 



a=2 



From (3.10), we see that ^Jn{P{X)~ P{ji)) has an asymptotic Gaussian distribution 
on a subspace of S{k,<C) with asymptotic coordinates 

T„(A) = {{MX - ft), UvtU*)tl {MX - A), Uw'',U*)tl) 

with respect to the basis vector {(A^ — Xa)^^Uv^U* , {Xk — Xa)~^Uw1:U*}'^^zl. 

Writing S(/i) for the covariancc matrix of r„(/i), and assuming that it is non- 
singular, 

(3.11) r„(M)'i](/i)-ir„(^) X? 



2fc-4- 
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3.4- Two sample testing problems on S2 

Let Qi and Q2 be two probability measures on the shape space Sj, and let /ii and 
/i2 denote the means of Qi o cj)-'^ and Q2 o 4>~^ , respectively. Suppose [ 
and [j/i], . . . , [ym] arc i.i.d. random samples from Qi and Q2 respectively. Let Xi — 
(j){[xi]), Yi = 4>{[yi]) be their images onto (p{T,2) which are random samples from 
Qi o and Q2 o (/)~^, respectively. Suppose we are to test if the extrinsic means 
of Qi and Q2 are equal, i.e. 

Ho : Pfii = P^i2 

We assume that both fii and ^2 have simple largest eigenvalues. Then under Hq, 
the corresponding eigenvectors differ by a rotation. 

Choose fi G S{k,C) with the same projection as fii and fi2. Suppose fi — UKU* , 
where A = Diag(Ai < A2 < • • • < \k) are its eigenvalues and U = [J7i, C/2, . . . , Uk\ 
are the corresponding eigenvectors. Under Hq, P^i = P^2 = UkU^. From (3.10), 

A^P{X - fi) 
fc-i 



J2 V2Rc{U:XUk){Xk - Xa)-'Uv^^U* 

a=2 
k-1 

Y,V2lm{U:XUk){Xk - Kr^UwlU* 

a=2 

1 

Y,{\k^\a)-\KXUk)UaUt 



a=2 

k-1 



a=2 
k-1 



(3.12) + Y,{\k - \a)-HU*kXUa)UkU:, 



a=2 



d^p(y - /i) 



J2 V2Rc{U:YUk){Xk - XaT^UvlU* 

a=2 
k-1 

+ ^/2luY{U:YUk){Xk - XaY^UwlU* 

a=2 

k-1 

Y.{Xk ' XaY^KYUkWaUl 



a=2 
k-1 



(3.13) + ^(A, - Xa)-\UlYUa)UkU:. 

Define 



a=2 



Ue{U*+^X,Uk), if 1 < i < fc - 2, l<j<n, 

\lm{U*_,^^^XjUk), if /c - 1 < i < 2fc - 4, 1 < j < n, 

iRe{U*+^YjUk), if 1 < i < fc - 2, 1 < J < m, 

\lm(U*_f,_^_3YJUk), if fc - 1 < i < 2fc - 4, 1 < j < m, 



(3.14) T{^^) = -^T(m).„ S{^^) = -^5(a.)., 
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Under Hq, and S{fj.) have mean zero, and as n,m —> oo, 

(3.15) V^Tifi) ^ N{0, V^S{^i) iV(0, 1]2(/i)) 

where and S2(/^) arc the covarianccs of T{p),i and S{p),i, respectively. Sup- 

pose (n/ (m + ri)) p, (to/ (m + n)) q, for some p, q > 0; p + q = 1. Then 



\/r7+7n(r(/.) - 5(/.)) ^ iV2fe_4(0, -Ei(Ai) + -S2(m)). 

p q 

Thus assuming Ei(/^), S2(/^) and hence iEi(/x) + iS2(/i) to be nonsingular, 

(3.16) (n + to)(T(a.) - 5(m))'(^Si(a^) + h:2{^Ji))-\f{^Ji) - 5(m)) X^u-a- 

Note that the nonsingularity assumption for Si(/i) and S2(m) are satisfied if, for 
example, Qi and Q2 have nonzero absolutely continuous components with respect 
to the volume measure on (identified with the Ricmannian manifold CP*^"^). We 
can choose ^ to be any positive linear combination of /ii and /i2. Then under Hq, /i 
will have the same projection on ^(S^) as (Xi and 112- We may take n = p^i + q^2- 
In practice, since ni and /i2 are unknown, so is /i. Then we may estimate /i by the 
pooled sample mean /i = (nX + mY)/{m + n), Si(/i) and S2(a*) by their sample 
estimates Si(/i) and I]2(/i), where 

Ei(a*) = ^T{^£)T{^^)' - r(/x)T(/x)', E2(/x) = ^S{ij)S{ij)' - S{ij)S{y)'. 
Then the two-sample test statistic in (3.16) can be estimated by 

(3.17) Tnm = {f{p) ~ Sm'{-t,{^) + lE2(A))-'(r(A) - 5(A))- 

71 TO 

Given level a, we reject Hq if 

(3.18) r„„, > A'2V4(l-a)- 

The expression for r„,„ depends on the spectrum of /i through the orbit [C/fc(/i)] 
and the subspace spanned by {J72(A)i • ■ • i ^fc-i(A)}- If the population mean exists, 
\Uk (A)] is a consistent estimator of [Uk (^)] and by perturbation theory (see Dunford 
and Schwartz [i .!], p. 598), the projection on Span{t/2(//), . . . , Uk-i{(i)} converges 
to that on Span{J72(/^), ■ . ■ , C^*;-i(Ai)}. Thus from (3.16) and (3.17), T„m has an 
asymptotic <%'2\._4 distribution. Hence the test in (3.18) has asymptotic level a. 

To test if the populations have the same spread around their respective means, 
we use the test statistic in (2.14), which is 

(3.19) r„,„ = 2>™"^'" 



where Xkn and Xkm are the largest eigenvalues of X and Y , respectively. Under Hq, 
Tnra has asymptotic Normal distribution. 
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4. Intrinsic mean and variation 

Let (Af, g) be a d-dimensional connected complete Riemannian manifold, g being 
the Riemannian metric on AI. Let the distance p = dg he the geodesic distance 
under g. Let Q be a probability distribution on M with finite Frechet function, 



Definition 4.1. The Frechet mean (set) of Q under the distance dg is called its 
intrinsic mean (set). The Frechet variation of Q under dg is called its intrinsic 
variation. Let Xi, X2, ■ ■ ■ , Xn be i.i.d. observations on M with common distribution 
Q. The sample Frechet mean (set) is called the sample intrinsic mean (set) and the 
sample Frechet variation is called the sample intrinsic variation. 

Let us define a few technical terms related to Riemannian manifolds which we will 
use extensively in the subsequent sections. For details on Riemannian Manifolds, 
see DoCarmo [10], Gallot et al. [15] or Lee [25]. 

1. Geodesic: These are curves 7 on the manifold with zero acceleration. They are lo- 
cally length minimizing curves. For example, consider great circles on the sphere 
or straight lines in di'^. 

2. Exponential map: For p e M, v e T^M, we define exppW = 7(1), where 7 is a 
geodesic with 7(0) = p and 7(0) = v. 

3. Cut locus: Let 7 be a unit speed geodesic starting at p, 7(0) = p. Let to be the 
supremum of all t for which 7 is length minimizing on [0, t]. Then 7(^0) is called 
the cut point of p along 7. The cut locus of p, C{p), is the set of all cut points 
oi p along all geodesies. For example, C{p) = {—p} on S'^. 

4. Convex ball: A ball B is called convex if, for any p,q Cz B, a unique geodesic from 
p to g lies in B, which is also the shortest geodesic from p to q. For example, 
any ball of radius 7r/2 or less in S"^ is convex. 

5. Sectional Curvature: Recall the notion of Gaussian curvature of two dimensional 
surfaces. On a Riemannian manifold choose a pair of linearly independent 
vectors u,v € TpM. A two dimensional submanifold of M is swept out by the 
set of all geodesies starting at p and with initial velocities lying in the two- 
dimensional section tt spanned be u,v. The curvature of this submanifold is 
called the sectional curvature at p of the section tt. 

In all subsequent sections, we assume that M has all sectional curvatures bounded 
above by some C > 0. 

The next result, due to Kendall [-2], gives a sufficient condition for the existence 
of a unique local minimum of _F in a geodesic ball of reasonably wide radius. 

Proposition 4.1. If the support ofQ is contained in B{p., r) with r < 7r/(2-\/C') and 
B{p,r) nC(j)) = (j), then the Frechet function F of Q has a unique local minimum 
in B{p, r). 

Recall that (Karchar [19]; see also Theorem 2.1 in Bhattacharya and Patrange- 
naru [7]) if Q{C{p)) = OVp € M, then every local minimum ji of F satisfies 



(4.1) 




(4.2) 




where Q is the image of Q under the map exp^-'^ on M \ C(/i). 
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4.I. Asymptotic distribution of the sample intrinsic mean 

One can use Theorem 2.3 to get the asymptotic distribution of the sample intrinsic 
mean. For that we need to verify assumptions (i) to (vii). The next resuh gives 
sufficient conditions for those assumptions to hold. 

Theorem 4.2. Suppose the support of Q is contained in a geodesic ball B{p, r) 

with center p and radius r as in Proposition ^.1. Let (f> = exp^^ : B{p,r) > 

TpM{K Define h{x,y) = dl{(j)-'^x, (j)-^y); x,y £ 3?'*. Let {(Drh))f^^ and 

{(Dr^sh))f be the matrices of first and second order derivatives of y i— *■ h{x,y). 
Let Xj = (l){Xj)(j = I, . . . ,n), Xi, . . . , Xn being i.i.d. observations from Q. Let ii — 
/i/ being the point of local minimum of F in B{p, r). Let /i„ = (j){iini), fJ-ni be- 
ing the point of local minimum of Fn in B{p, r). Define A ~ E((DrDs/i(Xi, ^^i, 
E = Cov{(Drh{Xi, ij,)))f^-j^. //A and E are nonsingular, then 

(4.3) V^(/i„-M) ^iV(0,A-iSA-i). 

Proof. When Q is considered as a probability measure on the compact ball B(j), r) 
(as the underlying metric space), /x„ is a consistent estimator of ^, by Proposi- 
tion 2.2. In view of Proposition 4.1, as in the proof of Theorem 2.3 in Bhattacharya 
and Patrengenaru [n], Assumptions (i)-(vii) of Theorem 2.3 are verified. □ 

Remark 4.1. The nonsingularity of E in Theorem 4.2 is a mild condition which 
holds in particular if Q has a density (component) with respect to the volume 
measure. The nonsingularity of A is a more delicate matter in general, involving a 
detailed analysis involving curvature and Jacobi fields. These matters arc considered 
in detail in Bhattacharya and Bhattacharya [3] . 

Remark 4.2. Under the hypothesis of Proposition 4.1 (and Theorem 4.2), the 
point of local minimum /x/ of F in B(p, r) may not be the global minimizer of F on 
M. However, if one restricts attention to the closed ball B{p, r) as the underlying 
metric space of interest, this point of local minimum is the intrinsic mean (on 
B{p,r)). The advantage of Theorem 4.2 over the earlier result Theorem 2.3 in 
Bhattacharya and Patrengenaru [8] is that here one allows a much wider support 
of Q, namely, the radius r here is twice as large as that allowed in the earlier result. 
This is particularly important in two-sample problems as well as in problems of 
classification involving several populations. Also from a statistical point of view, 
the mean shape is perhaps better represented if defined as the Frechet mean over 
B{p, r) than over the whole of M, since Q{M \ B{p, r)) = and since B{p, r) is a 
connected Riemannian manifold inheriting the metric of M . 

Theorem 4.2 can be used to construct an asymptotic 1 — a confidence set for /i/ 
which is given by 

(4.4) {m, : n{^i,, - li)' {k-^tk-^)-\^i,, ~ ^i) < Xl(\ ~ a)} 

where (E, A) are consistent sample estimates of (E, A) and A'Kl — a) is the upper 
(1 — Oif^ quantile of the chi-squared distribution with d degrees of freedom. 

Also we can perform a nonparametric test to test if two distributions Qi a-nd Qi 
have the same intrinsic mean /i/. Let [i = <f>{^i). Let Xi, . . . , Xn and Yi, . . . , Ym be 
i.i.d. observations from Qi and Q2, respectively. Let Q„ and Qm be the empirical 
distributions and /x„i and /im2 be the corresponding sample mean coordinates. We 
want to test Hq : jiu = 1^121 = Hi, say, against Hi : 7^ ^21, where /ii/ and fj.21 
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are the true intrinsic means of Qi and Q27 respectively. Then the test statistic used 
is 

(4.5) Tnrn = {n + m)(^„i - /-im2)'S"^(/l,ii - jU,„2), 

(4.6) E = (m + n) ( iAr^SiA^^ + -A^^EaA^M , 

\n m J 

(AijEi) and (A2,E2) being the parameters in the asymptotic distribution of 
^/n{^Jinl — 1^) and Vm(Mm2 — m), respectively, as defined in Theorem 4.2. (Ai,Si) 
and (A2, E2) are consistent sample estimates. In case n, m — > 00 such that n/ (m + 
n) — > 6*, < 6* < 1, then under the hypothesis of Theorem 4.2, assuming Hq to be 
true, 

(4.8) V^rT^(//„i - ^Jim2) ^ ^d(0, ^Ar^EiAr' + Y^A^^EaA^i). 

So Tnm Xj. We reject iJo at asymptotic level 1 — a if Tnm > '^di^ ~ 

We conclude with the test for the equality of intrinsic variations Vi, V2 of Qi 
and Q2- Under the hypothesis of Theorem 4.2, the test for Hq : Vi = V2, against 
Hi : Vi ^ V2, is provided by the asymptotically Normal statistic T„,„ in (2.14), as 
described at the end of Section 2. 



5. Examples 

In this section, wc record the results of two-sample tests in two examples. 

Example 1 (Schizophrenic Children). In this example from Bookstein [9], 13 land- 
marks are recorded on a midsagittal two-dimensional slice from a Magnetic Res- 
onance brain scan of each of 14 schizophrenic children and 14 normal children. 
Figures 1(a), (b) show the preshapes of the landmarks for the patient and normal 
samples along with the respective sample extrinsic mean preshapes. The sample 
preshapes are rotated appropriately as to minimize their Euclidean distance from 
the mean preshape. Figure 2 shows the preshapes of the normal and the patient 
sample extrinsic means along with the pooled sample mean. 

The values of the two-sample test statistics (3.17), (4.5) for testing equality of 
the mean shapes, along with the p-values are as follows. 

Extrinsic: T,„„ = 95.5476, p-value = P{X^2 > 95.5476) = 3.8 x 10"". 
Intrinsic: T„„ = 95.4587, p-value = PiX^^ > 95.4587) = 3.97 x 10"". 
The extrinsic sample variations for patient and normal samples are 0.0107 and 
0.0093, respectively. The value of the two-sample test statistic (3.19) for testing 
equality of extrinsic variations is 0.9461, and the p-value is 0.3441. The value of the 
likelihood ratio test statistic, using the so-called offset normal shape distribution 
(Drydcn and Mardia [12], pp. 145-146) is -2 log A = 43.124, p-valuc = PiX^2 > 
43.124) ~ 0.005. The corresponding values of Goodall's F-statistic and Bookstein's 
Monte Carlo test (Drydcn and Mardia [12], pp. 145-146) are -^22,572 = 1.89, p-value 
= P(^22,572 > 1.89) = 0.01. The p-value for Bookstein's test = 0.04. 

Example 2 (Gorilla Skulls). To test the difference in the shapes of skulls of male 
and female gorillas, eight landmarks are chosen on the midline plane of the skulls of 
29 male and 30 female gorillas. We use the data of O'Higgins and Dryden reproduced 
in Dryden and Mardia ([12], pp. 317-318). The statistics (3.17) and (4.5) yield the 
following values: 
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14 Normal children 13 landmarks, along with mean shape 



-0.5 -0.4 -0.3 -0.2 -0.1 0.1 0.2 0.3 0.4 

(a) 



14 schizophrenic children 13 landmarks, along with their mean shape 



-0.5 -0.4 -0.3 -0.2 -0.1 0.1 0.2 0.3 0.4 

(b) 

Fig 1. (a) and (b) show 13 landmarks for 14 normal and 14 schizophrenic patients, respectively, 
along with the mean shapes, * correspond to the mean landmarks; Ic shows the sample extrinsic 
means for the 2 groups along with the pooled sample mean. 



Extrinsic: r„™ = 392.6, p-valuc = P{Xi2 > 392.6) < 10"^^ 
Intrinsic: T„,„ = 391.63, p-value = PiX^2 > 391.63) < lO^^^. 

The extrinsic sample variations for male and female samples are 0.005 and 0.0038, 
respectively. The value of the two-sample test statistic (3.19) for testing equality of 
extrinsic variations is 0.923, and the p-value is 0.356. A parametric F-test (Dryden 
and Mardia [12], p. 154) yields F = 26.47, p-value = P(i^i2,46 > 26.47) = 0.0001. 
A parametric (Normal) model for Bookstein coordinates leads to the Hotelling's 
test (Dryden and Mardia [12], pp. 170-172) yields the p-value 0.0001. 
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Fig 2. The sample extrinsic means for the 2 groups along with the pooled sample mean, corre- 
sponding to Figure 1, 
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